CNN training with graph-based sample preselection: application to handwritten character recognition

نویسندگان

  • Frédéric Rayar
  • Masanori Goto
  • Seiichi Uchida
چکیده

In this paper, we present a study on sample preselection in large training data set for CNN-based classification. To do so, we structure the input data set in a network representation, namely the Relative Neighbourhood Graph, and then extract some vectors of interest. The proposed preselection method is evaluated in the context of handwritten character recognition, by using two data sets, up to several hundred thousands of images. It is shown that the graph-based preselection can reduce the training data set without degrading the recognition accuracy of a non pretrained CNN shallow model. Keywords-Convolutional neural network; Relative neighbourhood graph; Handwritten character recognition; Large data set; Training data set preselection

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Handwritten Gurumukhi Character Recognition Using Convolution Neural Network

Handwritten Character Recognition (HCR) is one of the challenging processes. Automatic recognition of handwritten characters is a difficult task. In this paper, we have presented a scheme for offline handwritten Gurmukhi character recognition based on CNN classifier. The system first prepares a skeleton of the character, so that feature information about the character is extracted. CNN based ap...

متن کامل

Neural Network Based Recognition System Integrating Feature Extraction and Classification for English Handwritten

Handwriting recognition has been one of the active and challenging research areas in the field of image processing and pattern recognition. It has numerous applications that includes, reading aid for blind, bank cheques and conversion of any hand written document into structural text form. Neural Network (NN) with its inherent learning ability offers promising solutions for handwritten characte...

متن کامل

Using a Synthetic Character Database for Training Deep Learning Models Applied to Offline Handwritten Recognition

We present our current work on building a deep learning architecture for the offline handwritten character recognition problem. The proposed system is based on training a deep Convolutional Neural Network (CNN) to recognize handwritten characters, using a new synthetic character database derived from UNIPEN dataset. The presented approach is inspired in some successfully-used neural architectur...

متن کامل

Handwritten Character Recognition using Modified Gradient Descent Technique of Neural Networks and Representation of Conjugate Descent for Training Patterns

The purpose of this study is to analyze the performance of Back propagation algorithm with changing training patterns and the second momentum term in feed forward neural networks. This analysis is conducted on 250 different words of three small letters from the English alphabet. These words are presented to two vertical segmentation programs which are designed in MATLAB and based on portions (1...

متن کامل

Handwritten Character Recognition Using CNN Gabor-Type Filters

This paper proposes an approach for handwritten character recognition using nonlinear normalisation, a CNN Gabor-Type filter, a Location Based Dominant Orientation Map and cross correlation. Based on a test set of 26 test characters acting as template and a set consisting of 4 sets of 26 unknown handwritten test characters, max. 92 % correct recognition is provided. Recognition rate is studied ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1712.02122  شماره 

صفحات  -

تاریخ انتشار 2017